Analysis method and algorithm design of biological sequence problem based on generalized k-mer vector

نویسندگان

چکیده

Abstract K-mer can be used for the description of biological sequences and k-mer distribution is a tool solving analysis problems in bioinformatics. We use vector as representation method sequence. Problems, such similarity calculations or sequence assembly, described space. It helps us to identify new features an old sequence-based problem bioinformatics develop algorithms using concepts methods from linear space theory. In this study, we defined generalized sequences. The meaning corresponding operations explained context. presented vector/matrix form several widely seen problems, including read quantification, pattern detection problem. Its advantages disadvantages are discussed. Also, implement assembly based on methods. shows practicability convenience algorithm design strategy.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Vector Equilibrium Problem with Generalized Pseudomonotonicity

In this paper, first a short history of the notion of equilibrium problem in Economics and Nash$acute{'}$ game theory is stated. Also the relationship between equilibrium problem among important mathematical problems like optimization problem, nonlinear programming, variational inequality problem, fixed point problem and complementarity problem is given. The concept of generalized pseudomonoton...

متن کامل

Space-efficient K-MER algorithm for generalized suffix tree

Suffix trees have emerged to be very fast for pattern searching yielding O (m) time, where m is the pattern size. Unfortunately their high memory requirements make it impractical to work with huge amounts of data. We present a memory efficient algorithm of a generalized suffix tree which reduces the space size by a factor of 10 when the size of the pattern is known beforehand. Experiments on th...

متن کامل

Statistics for K-mer Based Splicing Analysis

It is well acknowledged that alternative splicing module plays a crucial role to identify the variations of the RNA transcriptomes. In high-throughput short-read RNA, splicing analysis is a challenging task due to the uncertainty and time complexity of reads alignments onto genome and transcriptome. In this paper, we introduce k-mer based statistical method for splicing event analysis. The k-me...

متن کامل

Software Vulnerability Analysis Method Based on Adaptive-K Sequence Clustering

Software vulnerability analysis has become a hot topic recently. However, the traditional methods for analyzing software vulnerability have higher false positive rate. In this paper, adaptive K function is defined, and SVAAKSC (Software vulnerability analysis method based on adaptive-K sequence clustering) is presented. The collected objects in software vulnerability sequence database SVSD are ...

متن کامل

fabrication of new ion sensitive field effect transistors (isfet) based on modification of junction-fet for analysis of hydronium, potassium and hydrazinium ions

a novel and ultra low cost isfet electrode and measurement system was designed for isfet application and detection of hydronium, hydrazinium and potassium ions. also, a measuring setup containing appropriate circuits, suitable analyzer (advantech board), de noise reduction elements, cooling system and pc was used for controlling the isfet electrode and various characteristic measurements. the t...

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied Mathematics-a Journal of Chinese Universities Series B

سال: 2021

ISSN: ['1005-1031', '1993-0445', '1000-4424']

DOI: https://doi.org/10.1007/s11766-021-4033-x